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Abstract 



This paper completes reporting of the development of scales for measuring morale 
among Air Force personnel. A previously developed questionnaire keyed for eight scales 
was given to an independent sample. When these scores were compared with criteria de- 
rived from interviews, ratings, and Air Force records, validities were uniformly low. Al- 
though none of the validities are high enough for useful prediction, the scales do measure 
expressed attitudes with considerable reliability. 
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Validation of Morale and Attitude Scales* 



Th is is the last of four papers reporting outcomes of an investigation aimed at developing 
effective means for measuring morale among Air Force personnel. The first report of this 
series (Wnitloclc, 1960) reviewed the literature on the definition and measurement of morale and 
job satisfaction, with particular reference to theoretical considerations and to factor-analytic 
studies of job attitudes and morale. The second report (Cureton, I960) described the construc- 
tion of a 167-item questionnaire designed to measure all possible aspects of airman morale, and 
presented cluster-analysis and factor-analysis data from its administration to 1000 airmen at 
four Air Force bases. A third paper (Cureton & Sargent, I960) reported the factoring and refac- 
toring of a number of previous studies giving intercorrelations among job-attitude and morale 
scales, attempted to synthesize the results of these studies, and included notes on the special 
precautions required when small correlation matrices are factor-analyzed. 

On the basis of the data from the second report, a revised questionnaire was prepared. 
The present report gives the results of the administration of this revised questionnaire to a 
new sample of approximately 500 airmen. 

The Revised Questionnaire 

Construction 

The questionnaire used in the first tryout contained 167 questions. The second study of 
this series reported the development of 17 initial clusters. These clusters included 136 of the 
original 167 items. All items for which the factor analysis yielded communalities less than 
.10 werr omitted from the clusters. They either have low reliabilities or measure attitudes so 
specifi they are of little value as scale items. A number of additional items were omitted 

becaus of the clusters were large enough without them. 

The revised questionnaire consisted of these 136 items. They were rearranged into a 
new order such that no two from the same cluster were close together, and those from each 
cluster were fairly well distributed throughout the questionnaire. 

The Appendix relates the item numbers of Form A of the original questionnaire to the 
item numbers of the revised questionnaire. A copy of Form A is appended to the second re- 
port (Cureton, I960). 

Keying 

After the revised questionnaire was completed, it was found that the 17 original clusters 
were highly overlapping, nnd that several of them measured approximately what was measured 
by the original (167-item) questionnaire as a whole. 

From the overlap data and the data from a preliminary factor analysis of the 17 clusters, 
eight new cluster-scales were derived: 

1. Satisfaction with the Supervisor: 12 items. 

2. Satisfaction with the Air Force as a Military Organization: 7 items. 

3. Job Satisfaction: 8 items. 

4. Satisfaction with the Civilian Community and with the Attitudes of Civilians 

toward Airmen: 5 items. 



*Heleased by the author* for publication as a WADI) Technical" Report in March 1960. 
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5. Satisfaction with the Air Force as a Whole (i.e., with Air Force life in general): 

12 items. 

6. Satisfaction with Management and Communication (mainly the latter): 12 items. 

7. Satisfaction with the Unit and its ! eadership (co-workers and squadron officers 

and non-coms): 11 items. 

8. General Morale: 25 items measuring whatever was measured, on the average, 

by the original 167-item questionnaire as a whole. 

Scales 1-7 inclusive should be relatively independent of one another; i.e., their inter- 
correlations should he substantially lower than their readabilities, though they should all be 
positive and considerably above zero. Scales 1*4 inclusive should be relatively independent 
of Scale 8; Scales 5-7 inclusive should be too highly correlated with Scale 8 to be useful in 
a battery which includes Scale 8. Scale 2 should have relatively low reliability; the other 
seven should have satisfactory reliabilities. These statements are based on the data from 
the previous study, and subject to verification or refutation by the data of the present study. 

Scales 1-7 inclusive contain no common items. Scale 8 contains four items from Scale 
5, four from Scale 6, and two from Scale 7. Of the 136 items in the revised questionnaire* 
only 82 are scored, and ten of these p - scored on both Scale 8 and one other scale. 

The Sample 

The revised questionnaire was administered to 555 airmen. Of these, 44 were elimi- 
nated from further study because of inability to locate personnel records, and the question- 
naires of 31 were not available in time for the main analysis, leaving a total of 480 cases 
for the present analysis. 

The sample was selected by requesting from each squndron at one Air Force base a 
number of men proportional to the size of the squndron, nnd the squadrons were permitted to 
select the airmen wl.o were to participate. Airmen of the rank of master sergeant and above 
were excluded. 

The questionnaire was administered to groups of approximately twenty, with no group 
exceeding thirty. Prior to each session a slundard explanation of the project was given, 
after which the examiner read aloud the instructions on the cover page of the morale ques- 
tionnaire. The average time required was slightly under one hour. Administration began 
August 13, and the last stragglers were given the questionnaire on August 21, 1959. 

The Scales 

Frequency Distributions 

Table 1 gives the raw-score frequency distributions for the eight scales. For each 
item, responses could range from 1 (favorable attitude or high morale) to 5 (unfavorable 
attitude or low morale). The lowest possible score for each scale is the number of items 
in the scale, and the highest possible score is five times the number of items. Note that 
low scores indicate favorable attitudes or high morale. 
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All of the distributions are skewed toward the top, with the higher frequencies associated 
with the lower scores. This indicates that for this sample the attitudes measured are in general 
favorable, and the morale high. The skewness is particularly marked in the case of Scale 2 
(Military), indicating that these men regard the mission of the Air Force as important, its capa- 
bilities high, and their own contributions to the Air Force as a military organization substantial* 

Basic Statistical Data 

Table 2 gives the basic statistical data for the eight scales. The first complete row at the 
bottom contains entries 3n: three times the number of items in the scale. If each distribution 
were symmetric about the score half-way between the lowest possible score nnd the highest pos- 
sible score, the 3n value would be the mean. 

The next row gives the actual means. All of them are lower than the 3n values, indicating 
again that the mean attitudes tend in general to be favorable. The largest discrepancy is for 
Scale 2 (Military); this confirms the previous observation concerning the skewness of this scale. 

The following row contains the standard deviations. Here again the value for Scale 2 is 
conspicuously low, compared particularly with the value for Scale 4. Scale 2 has seven items; 
Scale 4 has only five. 

The next row contains the reliability coefficients, computed by the generalized Kuder- 
Richardson Formula 20 (generalized because the item-variances do not reduce to values of pq 
as they do for items scored 0 or 1). The reliability of Scale 2 is relatively low, as would be 
expected in view of its low standard deviation. Those of Scales 1 (Supervision) and 5 (Air 
Force as a Whole) are comparatively high for 12-item scales. That of Scale 8 (General Morale), 
with 25 items, is no higher. 

The last row contains the coefficients of reproducibility. In computing them we departed 
somewhat from Guttman's procedure. For each item, the five sets of response frequencies were 
distributed according to total scores. Response categories were not combined, and every cut- 
ting point was required to fall between two numerically different total scores. Guttman s re- 
quirement that the total error must not exceed the total non-error in any category of any item 
was followed rigidly; in several cases this increased the total error over what it would have 
been had the cut been so placed as to equalize the "above" error and the "below" error. The 
reported reproducibility of each scale is the mean reproducibility of its items. None of the 
scales even approach Guttman's criterion of homogeneity: .85 to .90. These scales are at 
best quasi-scales in his terminology, if they are even thnt. The rank correlation between the 
eight reliability coefficients and the eight corresponding coefficients of reproducibility is 
-.05. 

In the main body of the table, the first entry of each set of three is the raw correlation 
(product-moment) between the two scales. The second is the correlation corrected for attenu- 
ation. Since the items of each scale were distributed throughout the questionnaire, the 
scales were in effect administered simultaneously, and KH20 is an appropriate reliability 
coefficient to use in making the correction. The last entry of each set is the cosine of the 
angle between the cluster-centroid vectors of the two scales, based on the data of the pre- 
vious study (N ~ 1000), except that in column 8 it is the cosine of the angle between the 
cluster-centroid vector and the first centroid axis. This is almost, but not quite, the same as 
the cosine with the centroid of the cluster represented by Scale 8. The items of Scale 8 con- 
sisted of all those whose cosines with the first centroid axis were .90 or above, but its own 
centroid vector was not computed separately. 



The correlations corrected for attenuation approximate the cosines fairly well except for 
Scales 2 and 4. Scale 2 (Military) is much more nearly independent of the others than it was 
in the previous sample, and Scale 4 (Community) is somewhat more independent. The rank 
correlation between th^ 28 raw correlations and the cosines is .63; between the correlations 
corrected for attenuation and the cosines it is .64. 

Scales 5, 6, and 7 correlate highly with Scale 8. Using the correlations corrected for at- 
tenuation, the computed multiple correlation of Scales 4, 5, and 6 with Scale 8 is 1.02; the 
slight inconsistencies between the reliability coefficents and the intercorrelations, and the 
rounding errors in the two-digit entries overshadow any difference between the true multiple 
correlation, corrected for attenuation, and unity, A measure of "General Morale" just as good 
as that given by Scale 8 can be obtained by adding half the score on Scale 7 to the scores on 
Scales 5 and 6. 

Criterion Data 

Data from Official Records 

The investigators were granted access to the personnel records of the airmen designated 
to participate in the study. Approximately twenty items of information from the personnel fold- 
ers were recorded for each airman for whom records could be located. In addition, the Medical 
unit provided injury data and dispensary data (frequency of visits) on the airmen for whom 
they had records* The following items comprised the personnel data: 
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Name 
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Tbe administrative officer at the Base Hospital provided for each airman information on 
the number of times he had been to the hospital for injuries (initial visits) during the past 
year, and also the number of times (initial visits) he had visited the hospital for sickness 
during that period. 

Data from Personal Interviews 

Additional data were obtained through personal interview with the immediate super- 
visor of each airman tested. The interview was conducted in private, and at the beginning 
of each interview the nature and purpose of the project was explained. The supervisor was 
assured that no one in the Air Force would ever see his answers to the questions in the 
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interview. The supervisor was then presented with a sheet of paper on which was printed a 
graduated scale as follows: 

» | j J_ « i 

EXTREMELY VERY HIGH LOW VERY EXTREMELY 

HIGH HIGH LOW LOW 

He was asked to show the place on the scale that best described the morale of the group he 
supervised. Next he was asked to indicate which place on the scale best described the over- 
all level of job performance of his group. For either rating he was permitted to place his 
mark between "High" and "Low" if he ste£*»d that his group was too near average to be 
properly described by either of these terms. He was then asked these same two questions 
separately about each airman in his group who had completed the morale questionnaire. If 
there were two or more of his airmen participating, he was asked to rank them with respect to 
overall job performance. In cases where a supervisor had only one airman participating, he 
was questioned until a suitable reference group was established, and then requested to rank 
the participating airman with respect to this reference group. The supervisors were also 
asked to estimate the number of times each airman had asked to be excused from duty during 
the previous three-month period for rensons other than his own sickness. Finally, he was 
asked whether or not the airman was presently doing the kind of work for which he had been 
trained. 

Each interview took from five to twenty minutes, depending on the number of partici- 
pating airmen in the supervisor's unit. 

Quantification of Interview Data 

The morale and performance ratings were scored on a seven-point scale: 

1. Extremely high 

2. Very high 

3. High 

4. Average (neither high nor low) 

5. Low 

6. Very Low 

7. Extremely Low 

The performance rankings were scored so that 1 represented the top man in the group. 
These ranks were then converted into normalized scores (M ~ 500; a 100) using the 
Fisher-Yates tables. The "time-off-from duty" score was recorded as the number of ab- 
sences during the past three months as estimated from memory by the supervisor. These 
data were obtained as estimations by the supervisors because it was believed that the 
squndron records were not sufficiently uniform with respect to the recording of such ab- 
sences from duty. 

Validity Analyses 

The "General Morale" scale (Scale 8) was validated against all available record 
and interview criteria. For the ones deemed more important, all eight scales were 
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validated. Null hypotheses (hypotheses of zero validity) were tested in most cases by the F 
test, which was used instead of the t test even where the latter was applicable. This was 
done for simplicity of computation and uniformity of reporting. Where the criterion variables 
were essentially continuous, scale-criterion correlations were computed. 

For any given criterion item, there were in most cases some men for whom that item was 
not available. The number of cases varies, therefore, from one comparison to another. 

In interpreting scale means and mean differences, it should be noted again that a lower 
scale mean indicates a more favorable attitude or higher morale. 

Citations 

The mean scale scores for airmen who received one or more citations during the previous 
year and for those who did not are as follows: 

Received Gtotions No Citations 
Scole (N = 86) (N - 390) F_ 

1 28.65 28.48 

2 11.44 12.09 
'3 20.71 22.82 

4 13.44 14.27 

5 32.95 34.97 

6 29.89 30.48 

7 28.00 29.40 

8 65.21 67.53 

Scale 3 (Job Satisfaction) shows a difference significant at .01. Those who received citations 
like their job significantly better, but the actual mean difference is only 2.11 points or 31 per 
cent of the standard deviation (6.79) for the whole group of 480. 

Airman Performance Keports 

These are on a six-point scale with 6 the high rating. For the 295 of our 480 for whom 
they were available the distribution was as follows: 



.01 
2.57 
6.80 
3.06 
3.18 
■ .39 
2.19 
1.67 



.05 



.01 



3.86 



6.69 



Rotlng 1 2 3 4 5 6^ 

Frequency 0 1 26 122 129 17 

Combining the first fcur rating categories and the last two, the General Morale means are 
as follows: 

Rot I ng Cotegorles N 

01,02, 03, 04 146 

05, 06 149 

The difference is not significant; and such as it is, it associates higher morale with 
lower performance ratings. 



M 



64.16 
66.29 



F 

1.46 



F .05- 3 - 87 
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Proficiency Pay 

The General Morale means of those who did and did not receive proficiency pay are as 
follows: 

_N_ _M_ 

Received Proficiency Pay 14 66.94 

.11 F n . =3.86 

Did Not Receive 46 5 68.36 

The difference is insignificant. 

Injuries 

The correlations between the scales and number of initial visits to the Base Hospital 
for treatment of injuries during the previous year are as follows: 

Scale: 1234 5 *7q 



r: -°7 .07 .07 .23 .05 .03 .05 .06 

N 5=403 r 05 « .098 r 01 = .128 

Scale 4 (Community) has a correlation of .23, which is significantly greater than 0 beyond the 
.01 level, but not high enough for useful prediction. 

Dispensary Visits 

The correlations between the scales and the number of initial visits to the dispensary 
during the previous year are as follows: 



Scale: 1 



8 



•° 7 -.07 -.01 -.01 -.06 -.02 .03 .02 



N =403 r.05 = - 098 

None of the correlations are significantly different from 0 at the .05 level. 
Disciplinary Actions 

The mean scale scores of airmen who did and did not incur some disciplinary action 
during the previous year are as follows: 

Dliclplinary Actlon(i) No Dltclpllnary Action 

Scaly. (N = 191 (N = 447) F 

1 28.21 28.50 .00 

2. 11.10 11.93 1.09 

3 22.63 22.30 .00 

4 12.84 14.19 2.33 



5 32.53 34.65 .96 .05 ~ 3,86 

6 28.05 30.41 1.66 ~ 

7 29.10 29.06 .01 .01 " 6,69 

8 64.42 67.05 .54 
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While none of the mean differences is significant at the .05 level, it is interesting to note that 
six of the eight differences indicate higher morale on the part of those incurring some discipli- 
nary action. 

Job Performance (Supervisors' Ratings) 

The correlations between the eight scales and the supervisors' ratings of job performance 
are as follows: 

Scole: 1 2 3 4 5 6 7 8 

r: .16 .17 .20 .04 .14 .06 .13 .09 



N »395 r >05 = .098 r 01 = .128 

All of the scales except 4 (Community), 6 (Management-Communication), and 8 (General Morale) 
correlate significantly at the .01 level. For the three exceptions, the correlations are not sig- 
nificant at .05 either, though that of Scale 8 is almost significant. Despite statistical sig- 
nificance, the correlations are too low for useful prediction. That these correlations may be 
attenuated substantially by unreliability of the performance ratings is suggested by the fact 
that the correlation between these ratings and the performance rankings by the same supervisors 
is only .51. 

Rated Morale (Supervisors' Ratings) 
The correlations are as follows* 

Scale: 1 2 3 4 5 6 7 8 

r: .19 .23 .23 .02 .24 .17 .25 .15 



N *395 1 r 05 * .098 r 01 =.128 

All of the correlations except that with Scale 4 (Community) are significant at the .01 level or 
beyond, but none of them is high enough for much useful prediction. This may be due again, 
however, to unreliability of the ratings. 

Rated Group Morale (Supervisors* Ratings) 

Most of the ratings were in the above-average region, so it was necessary to combine 
categories to obtain substantial numbers in each. The mean General Morale scores for the 
combined categories were as follows: 

Group Morol e Rotlng 

"Extremely High" and "Very High" 
"High" 

"Middle," "Low," "Very Low," 
and "Extremely Low" 



52 64.46 

110 67.94 1.17 f\o5 =3 - 04 

47 68.98 
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The trend associates high General Morale with high ratings, but it is insignificant. The corre- 
lation between the 209 individual General Morale scores and the group ratings was .12, while 
r 05 »= .136 for N = 209. 

Absences (Supervisors' Estimates) 

The correlations between the eight scales and the supervisors' estimates of numbers of 
absences for reasons other than sickness of the men themselves are as follows: 



Scale: 



1 



3 



8 



.04 



N -391 



-.02 



.01 



.00 



.11 



-.01 



to " - 098 



.03 



.08 



None of the correlations is significantly different from 0 at the .05 level except Scale 5, which 
is just barely significant. Since this is one coefficient out of eight,' it too can be discounted. 

Compatibility of Work with Training 

The mean scale scores of the men who were and were not assigned to jobs for which they 
had been trained, according to supervisors' statements, are as follows: 





Trained 


Not Trained 




Scale 


(N - 335) 


(N - 55) 


F 


1 


27.80 


29.05 


.86 


2 


11.92 


11.98 


.01 


3 


22.01 


22.82 


.72 


4 


14.07 


14.73 


.14 


5 


34.66 


33.62 


.57 


6 


30.40 


29.76 


.32 


7 


28.93 


28.60 


.09 


8 


66.89 


65. c ,8 


.34 



F i05 - 3.87 



None of the differences is significant. 



Performance Rankings 

The correlations hetween the eight scale scores and the normalized supervisors' rank- 
ings on performance are as follows: 



Scale: 



8 



-.15 



-.23 



-.18 



-.03 



-.15 



-.12 



-.15 



-.17 



N -380 



.05 



.101 



.01 



.132 



All of the correlations except the one for Scale 4 (Community) are significantly different from 0, 
but again they are too low for useful prediction. The fact that they are all negative is due merely 
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to the scoring procedure: favorable attitudes tend to go with rankings indicating high per- 
formance. 



Military Rank 

The mean General Morale scores for men of various ranks are as follows: 



Rank N M 



A/B and A/3C 57 71,05 

A/2C 139 70.27 

A/1C 138 65.77 

S/Sgt 111 64.23 

T/Sgt 34 60.50 

N =479 F « 5.36 F nnnt . = 5.10 



The tendency for men of higher rank to have higher morale is highly significant. 
Size of Work Group 

The mean General Morale scores for men in work groups of various sizes are as 
follows: 



Group Size 


_N 


M 


2-3 


90 


66.13 


4 


96 


66.14 


5 


56 


63.9S 


6 


38 


71.18 


7 


24 


62.25 


8 


28 


75.18 


9 and over 


47 


65.09 


N -379 


F = 2.80 


F .05 = 2.13 



The differences as a whole are significant at the .05 level. The observed tendency for men 
in groups of 6 and 8 to have lower morale is not interpretable. 

Race 

The mean General Morale scores of White and Negro airmen are as follows: 



Race 


N 


M 


F 


White 


435 


67.68 










8.77 


Negro 


45 


60.58 





The morale of the Negroes is significantly higher than that of the Whites 



ERIC 



12 



Marital Status 

The mean General Morale scores by marital status are as follows: 



Moritol Status 


N 


M 


F 


Married 


298 


66.42 










1.15 


Not Married 


182 


67.98 





This relationship is not significant. 

Number of Dependents 

The mean General Morale scores of airmen having various numbers of dependents are 
as follows: 



Dependents 


N 


M 


None 


165 


68.58 


One 


131 


68.63 


Two 


76 


66.97 


Three 


68 


63.52 


Four or more 


40 


62.10 


N -480 


F « 2.65 





F .025 ~ 2 - 81 

There is a just significant tendency for higher morale to go with larger numbers of de- 
pendents. 

Age and Length of Service 

The correlations between scale scores and age, scale scores and length of service, 
and scale scores and length of service with age "held constant" by partial correlation are 
as follows: 



Scole 


1 


2 


3 


4 


5 


6 


7 


8 


r 12 


-.04 


-.25 


-.13 


-.13 


-.23 


-.15 


-.08 


-.18 


r 23 


-.01 


-.26 


-.10 


-.05 


-.25 


-.12 


-.04 


-.17 


r 12.3 




-.05 


-.09 


-.19 


-.02 


-.09 




-.07 


N -480 
'.05 = - 090 




Age « Var. 


1 

r .01 


Morole 
= .118 


= Var. 2 


Length 
r l3 = .89 


of Service 


- Var. 3 



The first row gives the correlations with age. All of them are significant except those for 
Scale 1 (Supervision) and Scale 7 (Unit). The second row gives the correlations with length 
of service. All of these are significant except those for Scale 1 (Supervision), Scale 4 
(Community), and Scale 7 (Unit). The third row gives the correlations with length of service 
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when age is partialled out. The Scale 3 (Job satisfaction) and Scale 6 (Management-Communica« 
tion) correlations are barely significant at the .05 level, but the Scale 4 (Community) correlation 
becomes significant at a level considerably beyond .01. 

Education 

The mean General Morale scores for airmen having various amounts of education are 
as follows: 



Education 


N 


M 


Grammar School or less 


60 


66.37 


Some High School 


145 


66.16 


High School graduate 


233 


68.23 


At least some College 


34 


64.00 


N = 472 F = 1.08 




F 05 -3.14 



There is no significant relationship between morale and amount of education. 

Aptitude Test Scores 

The correlations between the General Morale scale and four aptitude indexes of the 
Airman Classification Battery are as follows: 



Scale 



Al 


1 


2 


3 


4 


5 


6 


7 


8 


N 


Technician 


-.11 


->04 


.00 


.01 


.05 


.09 


-.04 


.03 


279 


Mechanical 


-.02 


-.04 


.00 


.00 


.08 


.13* 


.03 


.07 


283 


Clerical 


->\2* 


-.03 


.03 


.02 


.02 


.07 


-.03 


.01 


282 


Electronics 


-.08 


-.03 


-.01 


.03 


.10 


.12 


-.02 


.07 


213 



'Significant at # 0S level: r ^ = .118 for N = 282 

In general, the correlations are not significant. The two which are barely significant at the 
•05 level may be discounted. 



Airman Proficiency Tests 

As part of qualifying for change of AFSC to the next higher level in a career area, an 
airman takes a job knowledge test, the Airman Proficiency Test for his particular specialty. 
The correlations between the eight scales and scores on the Airman Proficiency Tests are 
as fellows: 

ScoL: 1 2 3 4 5 6 7 8 

r: -.05 -.03 .04 -.01 -.03 -.05 -.06 -.05 



N -343 r >05 - .107 

None of these correlations is significant at the .05 level, though all but one have the sign 
associated with a positive relationship between test scores and morale scores. 
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Assignment and Career Field 

The mean General Morale scores for those whose Primary AFSC and Duty AFSC agree 
completely, agree partially, and do not agree are as follows: 



Agrssmsnt N M 

Complete 367 67.08 

First 3 digits 70 64.20 

None 43 71.02 



N-480 F«2.63 F <05 «3.01 

The relationship is not significant, and the trend is not consistent. 

The mean General Morale scores for those whose Control AFSC and Duty AFSC agree 
completely, agree partially, and do not agree are as follows: 



Agreement 


N 


M 


Complete 


413 


66.98 


First 3 digits 


53 


66.40 


None 


14 


70.71 


N =480 


F = .49 


F 05 = 3.01 



Here again there is no significant trend. 

Length of Time in Squadron 

The correlations between the eight scale scores and length of time in squadron ore 
as follows: 

Scolr. 1 2 3 4 5 6 7 8 

n -.04 ^05 14 .08 -.04 .00 13 -.05 

N=480 r .os"' 090 r .01= - 118 

The correlations with Scale 3 (JoL satisfaction) and Scale 7 (Unit) are significant at the 
.01 level. None of the others is significant at the .05 level. 

Overall Value to the Air Force 

A few squaurons were asked to schedule for administration of the questionnaire their 
three or four least desirable airmen: the ones whose potentialities were least from the stand* 
point of the Air Force. Of the airmen selected for this sample, it was possible to match 47 
with 47 other airmen who were given the questionnaire along with the regular sample. It was 
desired to match on age, race, rank, years of service, education, marital status and Primary 
AFSC. This matching was achieved with minor exceptions. The matched-pair t test was 
applied to test the significance of the mean difference on each of the eight scales. The re- 
sults appear on the next page. 
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Scale: 12 3 4 5 6 7 8 



t: 1.97 1.99 1.05 -.48 .90 .81 2.68 1.40 



N » 47 pairs t Q5 « 2.01; ' 01 = 2,69 (two * side ^ 

For all scales except Scale 4 (Community), the "special group" had lower morale. However, 
only in the case of Scale 7 (I ' n i t) was the difference significant beyond the .05 level. 

Summary 

1. Scale 2 (Military) has n low mean, n lew standard deviation, a low reliability, and 
low correlations with the other scales. The airmen of this sample express such uniformly 
high satisfaction with the Air Force ns a military organization that the variability of their 
scores is depressed along with the mean, and this in turn lowers the reliability and the 
correlations with the other scales. This is the only scale on which the members ot this 
sample show a radically different average altitude from that shown by the members of the 
initial sample. 

2. With the exception of Scale 2, the eight scales all have satisfactory reliabilities. 
The reliability of Scale 4 (Community) is only .70, hut this scale has only five items. The 
other six scales all have reliabilities above .80. 

3. The intercorrelations among Scales 1-7 inclusive are all substantially lower than 
their reliabilities. Scales 5, 6, and 7 correlate highly with Scale 8. Weighted 1, 1, and .5, 
they fonn together just as good a measure of General Morale as does Scale 8. 

4. The reproducibilities are all in the neighborhood of ,5. In Guttman's terms these 
scales are at best quasi-scales. All reproducibilities were computed for the 5-category 
scales; categories were not combined for the purpose of increasing reproducibility. The 
factorial homogeneity of the scales had been demonstrated in a previous study. 

5. Against the criteria here employed, the validities are uniformly low. In a number 
of cases these validities are significantly different from zero at low probability levels, btit 
the actual correlations and mean differences are too low for useful prediction. In the case 
of the supervisors' ratings, the suspicion that low validities may be due in considerable 
part to low criterion reliability is supported by a correlation between ratings and rankings 
of the same airmen by the same supervisors on the same criterion variable (production) of 
only .51. 

Interpretations and Conclusions 

Scaling 

In the previous study, the primary criterion for the allocation of items to scales was 
factorial homogeneity. A set of iteais is factorially homogeneous if every item of the set 
has approximately the same factorial structure. The inter-item correlations should yield 
a Spearman hierarchy: one single general factor plus a unique factor for each item — the 
latter to be interpreted as error of measurement. It has been shown (Cureton, 1958) that 
factorial homogeneity is a necessary condition for the applicability of the generalized 
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Kuder-ltichardson Formula 20 as a measure of the internal consistency ("instantaneous" 
reliability) of the scale. A secondary criterion was reasonably high communality, since the 
communality of an item is a close lower bound for ihe item's reliability. In the following 
brief tabulation, r n is an estimate of |l s * reliability of each of the eight sca'es obtained by 
taking t*\e mean communality from the previous study (N « 1000) as an estimate of the reli- 
ability of one item and raising by the Spearman-Hrown formula. KR20 is the generalized 
Kuder-Richardson formula 20 reliability as actually computed from the data of the present 
study (next-to-last row of Table 2). 



ScoU: 


1 


2 


3 


4 


5 


6 


7 


8 


r n ! 


.94 


.69 


.86 


.72 


.93 


.89 


.88 


.95 


KR20: 


.92 


.65 


.85 


.70 


.91 


.84 


.86 


.92 


Diff: 


.02 


.04 


.01 


.02 


.02 


.05 


.02 


'13 



In every case r n exceeds KR20, though theory says r n is a lower bound. The reason is 
probably that the previous sample was somewhat more heterogeneous, and hence more 
variable, than the present sample. In any event, the agreement is striking. The largest 
difference is .05, the mean difference ia .03, and the rank correlation between the eight 
pairs of values is .92. 

The items of a perfect Guttman scale form factorialiy a perfect simplex, which has- 
common factors if n is even, and ^ (n-1) if n is odd (DuHois, Manning, & Spies, 1959, 
pp. 8-23). It follows that if the items of a scale agree perfectly or almost perfectly in par- 
tially ranking the subjects, the scale c^not be factorialiy homogeneous, and the Kuder* 
Richardson formula 20 is not an experimentally unbiased estimate of its internal consis- 
tency. Interns! consistency and Guttman scalability are mutually incompatible criteria 
for the choice of items to form a scale. 

Which criterion is "better" cannot be answered until we can agree upon a criterion 
of "goodness.** No such criterion has as yet been proposed. We may note, however, that 
Mahoney (1956), who did find substantial correlations between morale-scale scores and 
behavioral indices, used Guttman's methods in the construction of his scales. We chose 
internal consistency (factorial homogeneity) as the criterion, because it ia clearly the 
appropriate criterion for the formation of scales to be used as variables in factor-analytic 
studies of the inter*scaie correlations. This choice may well explain why our scales have 
good Kuder-Richardson-20 reliabilities but low coefficients of reproducibility. The use by 
other writers of Guttman scalability as a criterion for scale construction may also explain 
in part why re-factoring of their inter-scale correlations with iteration of the initial factor- 
ing procedure to produce stable communalities seems not too infrequently (e.g., in the case 
of Mahoney's data) to lead to a Hey wood case, with one communality converging to a value 
greater than unity. If the number of items in each of several perfect Guttman scales ex- 
ceeds the number of scales used in a factor analysis, the minimum number of factors re- 
quired by the scale criteria will exceed the maximum number which can be extracted from 
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that number of »cales« Whatever their practical values, Cuttman acelea are not the type of 
variables suitable for fsctor analysis. Guttmsn presumably recognized thia in proposing ths 
aubstitution of simplex, circumple:;, and rsdex snslysis for fsctor anaiyaia. 

Validity 

The dilemma which haa plagued inveatigators in the sres of morsle for some time is 
well Illustrated in the present study. If one clsims thst the morsle scsles are sll thst they 
should be, then one must conclude thst the criterion messures sre either unrelisble or non- 
relevsnU If it is assumed thst the criterion messures are relevant and worlcobly reliable, 
then it muat be concluded that the morale acalea are not valid* 

In the paat most investigators hsve been content to draw one or ia* other of these con- 
clusions (seldom both), and proceed ss though the assumption remslned tenable concerning the 
inherent poaitive relationahip between exprened morale and behavioral continua representing 
deairable organizational behavior. The preaent investigstors depcrt from this tradition, Th* 
following ciaima are made: 

1. The measurement chsrscteristics of the morsle scsles developed in the present ctudy 
meet the technical requirements of good assuring devicea. The items sre judged to measure 
expressed morale* 

2. The relevsnce of the criterion measures to orgsrusstionsl objectives is judged high. 
In view of the low correlations between the morale scsles snd the criterion messures, 

the conclusion is drawn thst the relationship between expressed morale siid the selected 
messi cs of organizational behavior is zero to negligible* 

Some question msy then srise ss to the usefulness of expresseo norsle messures, if 
such messures hsve no predictive efficiency for behsvioral criteria. Such a queation is not 
sppropriste since in some studies, morsle scales have hsd such predictive power (Mfchoney, 
1956). The sppropriste question is, therefore, "under whst condition will morsle measurca 
be related to performance measures?" Msrquis, et al. (1951) hsve suggested thst low morsle 
snd high productivity will occur together under the conditions of: (l) clearly-defined eingle 
goals, (2) clesr-cut division of Isbor, (3) necesssry skills well-known snd possessed by 
group, snd (4) strong pressures from outside the group itself towsrd conformity. 

Another question thst arises concerns the "validk/" of morsle scale messures inde- 
pendent of their correlation* with sny external criteris. This question reduces to the ques- 
tion of the absolute desirsbility of high expressed morsle. While this is ususlly « ;natter 
of socisl definition, it is generslly sssumed thst high expressed morsle is "good. Certain- 
ly, the condition of high expreased job sstisfsction, slong with expressed feelings of personal 
gosl fulfillment is consonsnt with the objectives of those who sre concerned with the welfire 
of the individual in society. The question of whether or not the respondents in morsle studie 
indicate their "true" or "real" feePugs is meaningless, since operstions do not exist for the 
determination of this kind of "validity." 

The above results snd conclusions need not be too disconcerting to thoie who hsve 
specislized in morsle messurement. Stogdill (1959) relegates "morale" to a le*el of impor- 
tance equal to that of the "productivity" and "integration," which three outcomes repress* 
the dimensions of group schievement. Thus "morsle" is conceived ss logicslly independent 
of both integration (of the group) and productivity. All three represent group accomplishment 
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and represent the effects of transformation through group structure of the three basic group- 
input variables: performances, interactions, and expectations. From the standpoint of the 
present study, Stogdill's theoretical formulation is significant in that morale is assigned to 
the output end of the input-output model. This has the effect of changing radically the direc- 
tion of morale research. Instead of designing studies in which morale is considered an inter* 
vening variable, or even a cause, in the cause-effect model, it is now suggested that morale 
studies be designed from the standpoint of attempting to delineate the conditions under 
which morale is related to other group outcomes. This in turn has the effect of putting on 
equal theoretical par those studies in which expressed morale measures are discovered to 
relate only to one another and those in which such morale measures are discovered to corre- 
late substantially with other variables. 
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Appendix 

hem Numbers Identifying the Same Item in Form A 
and Revised Questionnaires 



Form A 


R •vised 


rorm A 


Revised 


1 


0 


39 


66 


9 
•* 


17 


40 


40 


3 


a 

Q 


A 1 

41 


79 


4 




A O 

4o 


52 


5 


o 

o 


A A 

44 


27 


6 


36 


AK 

45 


10 


7 


16 


A \ 

4o 


15 


8 


47 


47 


42 


9 


ii 


49 


49 


10 


22 




44 


12 


6 


53 


59 


13 


7 


54 


28 


14 


5 


55 


33 


15 


20 


57 


70 


16 


12 


58 


51 


17 


32 


60 


23 


18 


14 


61 


103 


19 


38 


62 


62 


20 


54 


63 


21 


21 


45 


64 


82 


22 


26 


65 


61 


23 


4 


66 


35 


24 


1 


67 


50 


25 


25 


68 


39 


26 


65 


69 


41 


27 


90 


70 


60 


28 


If? 


71 


108 


29 


5" 


72 


100 


30 


24 


73 


115 


31 


43 


74 


89 


33 


2 


75 


67 


34 


13 


76 


34 


35 


55 


77 


68 


38 


37 


78 


73 



Form A 


RtvUtd 


Form A 


Rtvlstd 


79 


84 


123 


106 


80 


74 


125 


114 


81 


46 


126 


105 


82 


92 


127 


31 


83 


63 


128 


102 


85 


85 


129 


56 


86 


78 


130 


107 


89 


96 


131 


112 


90 


77 


132 


122 


92 


58 


133 


132 


93 


53 


134 


117 


94 


69 


135 


95 


95 


30 


136 


118 


96 


86 


137 


129 


08 


81 


138 


93 


99 


19 


139 


121 


100 


64 


140 


91 


102 


75 
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